
A 



J o2 



5 



L 



//<? 



fi>Ap*PAt LAaj6uA6c 



l~>t>/t£> A*Jt> CLASS 



HQ 



n4 



~ir~ 



A*£M*tY 



,-J 



/ 



/oo 



lot. 



106, 



(66 



j/s 



2o2 

1 



Augmented 
training 
corpus 




/ 



Zoo 



Training 






Automatic 




corpus 


r ► 




clustering 







Augment 






Corpus 






utterance 



ZD6 



W-»Class(W) 





Feature 






Selection 






J-NLU 






(LSI) 






Training 





J-NLU 
(LSI) Model 




Lf 



z/6 



J-NLU 
(LSI) 
Classifier 



Best 
categories 



22o 



J/5" 



Set up initial word class mapping 

Compute the mapping perplexity on the training 
corpus 

Do until some stopping criterion is met 
Do for each word w in vocabulary W 
Remove w from class 

Do for all existing classes g: j 
Compute perplexity as if w were moved to g i 
end-do-loop 
Assign w to the class with the lowest perplexity 
end-do-loop 
end-do-loop 

Exit 
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Append: append one corpus to another 
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Join: join the utterances in two corpora 
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Interleave: insert each class after its word 
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Given a parameter p: 

1 . Calculate the IG value of each term 

2. Sort the terms by their IG values into 
descending order 

3. Set the threshold t to the IG value at 
top p percentile of sorted terms 

4. Select terms with IG value > t 
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